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Abstract 

Background: The counterfactual approach provides a clear and coherent framework to think about a variety of 
important concepts related to causation. Meanwhile, directed acyclic graphs have been used as causal diagrams in 
epidemiologic research to visually summarize hypothetical relations among variables of interest, providing a clear 
understanding of underlying causal structures of bias and effect modification. In this study, the authors aim to 
further clarify the concepts of bias (confounding bias and selection bias) and effect modification in the 
counterfactual framework. 

Methods: The authors show how theoretical data frequencies can be described by using unobservable response 
types both in observational studies and in randomized controlled trials. By using the descriptions of data 
frequencies, the authors show epidemiologic measures in terms of response types, demonstrating significant 
distinctions between association measures and effect measures. These descriptions also demonstrate sufficient 
conditions to estimate effect measures in observational studies. To illustrate the ideas, the authors show how 
directed acyclic graphs can be extended by integrating response types and observed variables. 

Results: This study shows a hitherto unrecognized sufficient condition to estimate effect measures in observational 
studies by adjusting for confounding bias. The present findings would provide a further understanding of the 
assumption of conditional exchangeability, clarifying the link between the assumptions for making causal inferences 
in observational studies and the counterfactual approach. The extension of directed acyclic graphs using response 
types maintains the integrity of the original directed acyclic graphs, which allows one to understand the underlying 
causal structure discussed in this study. 

Conclusions: The present findings highlight that analytic adjustment for confounders in observational studies has 
consequences quite different from those of physical control in randomized controlled trials. In particular, the 
present findings would be of great use when demonstrating the inherent distinctions between observational 
studies and randomized controlled trials. 

Keywords: Bias, Causal inference, Counterfactual, Directed acyclic graphs. Effect modification. Exchangeability, 
Randomization, Response types 



Background 

The counterfactual approach provides a clear and coherent 
framework to think about a variety of important concepts 
related to causation [1,2]. In particular, the counterfactual 
approach to confounding has been widely accessible to epi- 
demiologists since the publication of a classic methods 
paper by Greenland and Robins [3], and the concept of bias 
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is now explained in the counterfactual framework [4-12]. 
(Note that an update of the classic methods paper was re- 
cently published [13]). Meanwhile, directed acyclic graphs 
(DAGs) have long been used as causal diagrams in epidemi- 
ologic research to visually summarize hypothetical relations 
among variables of interest [14,15]. DAGs have been used 
extensively to determine the variables for which it is neces- 
sary to control for confounding bias to estimate causal 
effects [14-20]. Besides, Hernan et al. [21] showed that vari- 
ous types of selection bias share a common underlying 
causal structure, and referred to conditioning on common 
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effects as selection bias. Furthermore, VanderWeele and 
Robins [22] provided a structural classification of effect 
modification by using DAGs. Indeed, the different ap- 
proaches provide complementary perspectives, and can be 
employed together to provide a clearer understanding of 
causality [23]. 

In this study, we aim to further clarify the concepts of 
bias (confounding bias and selection bias) and effect modi- 
fication in the counterfactual framework. To achieve this, 
we show how theoretical data frequencies can be described 
by using unobservable response types both in observational 
studies and in randomized controlled trials. These des- 
criptions also demonstrate sufficient conditions to estimate 
effect measures in observational studies, which would pro- 
vide a further understanding of the assumption of con- 
ditional exchangeability. To illustrate the ideas, DAGs are 
employed, and we show how one can extend the original 
DAGs by integrating response types and observed variables. 
We deal only with structural (systematic) relations among 
the underlying variables of interest, so that an issue of ran- 
dom variation does not arise. Throughout this article, we 
assume that the consistency condition is met [24-28]. 

Methods 

Definitions and notation 

A causal diagram and causal effects 

We use a total of 4 binary variables as shown in Figure 1. 
We let D denote a binary outcome of interest (1: outcome 
occurred, 0: outcome did not occur) and let E denote a 
binary cause of interest (1: exposed, 0: unexposed) that is 
potentially manipulable. We let C denote a binary common 
cause of E and D (1: present, 0: absent), which is also po- 
tentially manipulable. Typically, C is called a confounder of 
the effect of E on D. (Note that we assume that C precedes 
E temporally in this study, which is in general not neces- 
sary for C to be a confounder. Recently, VanderWeele and 
Shpitser [29,30] further discussed the definition of a con- 
founder.) As explained later, C can also act as a direct effect 
modifier for the causal effect of E on D because C is a dir- 
ect cause of D [22] . Finally, we let S denote selection vari- 
able (1: selected, 0: not selected), which is a common effect 
of E and D. Adjustment for S yields a spurious association 
between E and D, which is called selection bias [21]. Alter- 
natively, one may assume that S is also directly influenced 
by C, as shown by using a dashed arrow in Figure 1. Al- 
though the arrow is assumed to be absent throughout this 
paper to avoid technical complications, the following dis- 
cussion can be readily extended to the situations in which 
the dashed arrow is present. 

Throughout this paper, we discuss the case where the 
causal effect of primary interest is the effect of £ on D in 
the total population, including the subpopulations of C = 0 
and C=l. In the counterfactual framework, this causal 




E ► D 




S ^ 

Figure 1 A causal diagram depicting a hypothetical example. 

E, D, C, and S denote exposure, disease, confounder, and selection 
variable, respectively. C may act as a direct effect modifier 
simultaneously. In this paper, 5 is assumed not to be influenced 
directly by C (i.e., the dashed arrow is absent). The present 
discussion, however, can be readily extended to the situations in 
which one assumes that the dashed arrow is present. 



effect is given by comparing PlDe=o = 1] and P[De^i = 1], 
where PID^ = 1] denotes the proportion of subjects that 
would have developed the outcome D =1 had all subjects 
in the total population received exposure value e. The var- 
iables Dc^o and D^^i are referred to as potential outcomes. 
Therefore, when we use risk ratios (RRs) as a measure of 
interest, a causal RR in the total population is given by 



Causal RRs in the total population can be consistentiy esti- 
mated under the assumption of (conditional) exchangeabil- 
ity, or, equivalently, no unmeasured confounding (i.e., 
£]j£)e for Ve). Furthermore, in addition to the effect of 
£ on Z) in the total population, we also discuss causal effect 
of £ on D within the subsets of C. In this case, a causal RR 
within the subsets is given by 

P[D,^, = 1\C = c] 

P[De=0=l\C=c]' 

Causal RRs in the subsets of C can be consistently esti- 
mated under the assumption of conditional exchangeability, 
or, equivalently, no unmeasured confounding given data on 
C (i.e., £]j£)e|C for Ve). Note that, when the causal effect of 
interest is the effect of £ on D either in the total population 
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or in the subsets of C, intervening on E is of concern, and 
one does not consider intervening on C. Indeed, as outlined 
by VanderWeele [31], intervening on C would only be of 
concern if the joint effect of E and C on D was of interest. 
Therefore, under the situation in which C is being consid- 
ered as a potential confounder as well as direct effect modi- 
fier (Figure 1), intervening on C is not of interest. 

When we show how theoretical data frequencies can 
be described by using unobservable response types in 
observational studies, however, it is of great use to eluci- 
date the relations between C and E in the counterfactual 
framework. By so doing, we demonstrate sufficient con- 
ditions to estimate effect measures in observational stud- 
ies, which would provide a further understanding of the 
assumption of conditional exchangeability. 

Response types 

First, we elucidate the relations between C and E by hypo- 
thetically conceptualizing potential outcomes of E in the 
counterfactual framework. We let Edu) denote the poten- 
tial outcomes of E for individual a if, possibly contrary to 
fact, there had been interventions to set C to c. (In the fol- 
lowing sections, we explicitly show co to discuss response 
types.) Then, for each individual w, there would be 2 rele- 
vant potential outcomes of E, i.e., £i(cj) and Eo(cjj, which 
correspond to exposure status of that individual when C is 
present and absent, respectively. As a result, individuals 
can be classified into 4 (i.e., 2 ) different E response types 
as enumerated in Table 1. We let E^{co) denote E response 
type of individual co. In some cases, the effect of C may be 
in the same direction for all individuals in the population. 
We say that C has a positive monotonic effect on E if Ec 
{(o) is non-decreasing in c for all individuals, i.e., £i(cj) > Eq 
(cj) for Vo), which excludes E response type 3. Note that 
this should be clearly distinguished from the assumptions 
of no preventive action or no preventive sequence [32,33]. 

In a similar manner, we let Dceiu) denote the potential 
outcomes of D for individual cj if, possibly contrary to fact, 
there had been interventions to set C to c and to set E to 
e. For each individual cj, there would thus be 4 possible 
potential outcomes Dii(cj), Doi(w), Dio(<m), and Doo(<y), 



Table 1 Enumeration of 4 response types for exposure E 


and corresponding potential outcomes 


£ type 


Potential outcomes of £ 




£c(w) 




£,M £oM 


1 

2 


1 1 
1 0 


3' 


0 1 


4 


0 0 



^ Under the assumption of positive monotonlcity (i.e., f i (oj) > Eo{co) for Vw), tfiis 
response type is excluded. 



resulting in 16 (i.e., 2*) different D response types as enu- 
merated in Table 2 [34] . We let D^{o)) denote D response 
type of individual <u. We say that C and E have positive 
monotonic effects on D if Dce{oj) is non-decreasing in c 
and e for all individuals, i.e., Z)ce(cj) > Dc'e'(cj) for Vcj 
whenever c > c ' and e > e '. Under this assumption, the in- 
dividuals of D response types 3, 5, 7, and 9 through 15 are 
excluded; and individuals of D response types 1, 2, 4, 6, 8, 
and 16 may remain [32] . 

Likewise, we let Sej(cj) denote the potential outcomes of 
S for individual w if, possibly contrary to fact, there had 
been interventions to set £ to e and to set D to d. For each 
individual cj, there would thus be 4 possible potential out- 
comes Sii((y), 5oi(<a), Sio(w), and Soo(tj), resulting in 16 
(i.e., 2*) different S response types as enumerated in Table 3. 
We let S^(cj) denote S response type of individual w. 

Finally, we integrate information about the potential out- 
comes discussed above to produce 2 types of compound 
potential outcomes, which are also called nested counter- 
factuals [2]. (Note that compound potential outcomes have 
been extensively used in the issues of mediation and direct/ 
indirect effects [35-38].) First, we combine the potential 
outcomes of E and the potential outcomes of D to define 

Table 2 Enumeration of 16 response types for outcome D 
and corresponding potential outcomes 

D type Potential outcomes of D 



D,,M Doi(u) D,o(a)) Doo(w) 



1 

2 b, c 




1 
1 


1 1 

1 0 


2 a, b, c 




1 


0 1 


4 




1 


0 0 


^ 3, b, c 




0 


1 1 


6 




0 


1 0 


y a, b, c 




0 


0 1 






0 


0 0 


9 a, b, c 


0 


1 


1 1 


IQi", b,c 


0 


1 


1 0 


11 " 


0 


1 


0 1 




0 


1 


0 0 


13 " 


0 


0 


1 1 


14 a, b 


0 


0 


1 0 


15 a, b 


0 


0 


0 1 


16 


0 


0 


0 0 


^ Under the assumption of positive monotonicity {i.e., Dcelw) > Dc e'i'^) for Voj 
whenever c > c ' and e > e '}, these response types are excluded. 

Given no interaction at the Individual level on the additive scale between C 
and f in the counterfactual framework (I.e., Dii((j) - Doi(w) - Dio(w) + Doo(w) = 
0 for Vw), these response types are excluded. 

Given no Interaction at the individual level on the multiplicative scale 
between C and f in the counterfactual framework (I.e., Di 1 (at)Doo((iJ) = Dq-^[(j)) 
Dio((iJ) for Vw), these response types are excluded. 
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Table 3 Enumeration of 16 response types for selection 
variable S and corresponding potential outcomes 

S type Potential outcomes of S 



S^^^(JLl) Soi(w) S,oM SooM 



1 

2b,c 


1 1 
1 1 


1 1 

1 0 


^ a, b, c 

4 


1 1 
1 1 


0 1 
0 0 


^ a, b, c 


1 0 


1 1 


6 


1 0 


1 0 


y a, b, c 


1 0 


0 1 




1 0 


0 0 


g a, b, c 


0 1 


1 1 


lO^. b, c 


0 1 


1 0 


11 ^ 


0 1 


0 1 


12 a, h 


0 1 


0 0 


13^ 


0 0 


1 1 


14 a, h 


0 0 


1 0 


15 a, b 


0 0 


0 1 


15 


0 0 


0 0 


^ Under the assumption of positive monotonicity (i.e., Sgff{aj) >Sg'd'{aj) for Vto 
whenever e>e'and d>d'), these response types are excluded. 
^ Given no interaction at the individual level on the additive scale between E 
and D in the counterfactual framework (i.e., S^^ioj) - Soi(at) -S-io{(o) + Soo(fu) = 0 
for Vw), these response types are excluded. 

Given no interaction at the individual level on the multiplicative scale 
between E and D in the counterfactual framework (i.e., 5n (aj)Soo((i^) = Soi(t(j) 
S,o(w) for V(j), these response types are excluded. 



DcE^ {(o) . In other words, the compound potential out- 
comes of D are defined by (i) confounder status (C(cj) = 1, 
C(cS) = 0) and (ii) potential exposure status following an 
intervention on confounder (£i(cj), Eq{cj)). For each individ- 
ual cj, there would thus be 4 possible compound potential 
outcomes Die^ (w), Dieo Dqe^ (w), and Dqe^ («)■ Second, 
we combine the potential outcomes of E, the potential out- 
comes of D, and the potential outcomes of S to defme 
SecD^j; ^^{(^) ■ Note that the compound potential outcomes 

of S are defmed by (i) potential exposure status following 
an intervention on confounder (£i((a), Eo(u)) and (ii) the 
compound potential outcomes of D {Die^{co) , Di£„(w), 
£)o£j((u), and Z)o£o('^))- Thus, for each individual (U, there 
would be 8 possible compound potential outcomes Se^Di^^ 

(w), SeiDieS")' %0„ij(w), %£,„^.^(cj), Se„DieS")' ^EoDiE^ 

(w), Se„d„e, (w), and Se,Doe, (")■ 

Combination of 4 £ response types, 16 D response 
types, and 16 S response types yields 1,024 (i.e., 4 x 16 x 
16) EDS response types. As noted above, under the as- 
sumption of positive monotonic effect of C on E, the 
number of possible E response types is reduced from 4 
to 3. Further, under the assumptions of both positive 



monotonic effects of C and E on D and no interaction at 
the individual level on the additive scale between C and 
E on D, the number of possible D response types is re- 
duced from 16 to 4 (see footnote of Table 2). Analogous 
argument applies to S response types (see footnote of 
Table 3). Consequently, the number of possible EDS re- 
sponse types is reduced from 1,024 to 48 (i.e., 3x4x4). 
In Table 4, we show a complete enumeration of these 48 
EDS response types. To enhance readability. Table 4 
shows only selection status when C = 1 (i.e., Se^Die^ i^) ) 
and when C = 0 (i.e., Se„Dbe„ {(^)) among Se,d^^ „ Note 
that we made these restrictive assumptions to show the 
correspondence between E response types, D response 
types, and S response types in Table 4, which would be 
of great help to understand the present findings. The fol- 
lowing discussion however applies even without these 
assumptions. Thus, in the following sections, we use a 
total of 1,024 EDS response types, considering general 
cases in which these assumptions are not met. 

Four hypothetical situations 

In Figure 2, we give an overview of 4 hypothetical situations 
by using DAGs. Figure 2 A describes a situation in which re- 
searchers conduct an observational study and the informa- 
tion about a portion of subjects is unavailable due to loss to 
follow-up. Note that the square around 5 indicates that the 
analysis is restricted to those who do not drop out (i.e., 
S = 1). Investigators often encounter this situation in obser- 
vational studies. Researchers should be concerned about 
both confounding bias and selection bias in this situation. 

Subsequently, Figure 2B shows a situation in which re- 
searchers can obtain the information about the total 
population, including those who dropped out. In this 
situation, a possibility of selection bias can be ruled out 
since researchers do not condition on S. 

In observational studies, researchers usually aim to 
eliminate confounding bias by employing some statistical 
procedures, e.g., standardization and inverse-probability 
weighting method. In other words, they aim to analy- 
tically block or remove the path between C and E by 
making an adequate adjustment. (Note that outcome 
modeling techniques such as disease risk scores focus on 
the path between C and D [39].) By contrast, in random- 
ized controlled trials, researchers manipulate the value 
of E by employing certain interventions; they physically 
prevent E from varying in response to variations in C. 
Thus, as shown in Figure 2C and D, C would no longer 
have effects on E, and the arrow from C to £ is erased 
or removed [14]. This should be clearly distinguished 
from analytic control of C in observational studies. 

In the following sections, we demonstrate significant dif- 
ferences between these 4 hypothetical situations, by describ- 
ing theoretical data frequencies in terms of response types. 
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Table 4 Enumeration of 48 EDS response types and corresponding potential outcomes 



£ type 


D type 


S type 


Potential 
outcomes of f 


Potential outcomes 
of D 




Compound potential 
outcomes of D 


Potential outcomes 
of S 


Selection 
status 














D„(u) 






DcE 










s 


EcD,E, (w) 










Eo 




Doi D,o 


Doo 


Die, 


D1E„ 


Do£, 


Do£„ 




S,o 


Soo 


Se,d 




1 
1 
1 


1 
1 
1 


1 

4 
6 








1 (1)- 
1 (1) 


(1) 
(1) 
(1) 


1 
1 


(1) 
(1) 
(1) 


(1) 
(1) 
(1) 




1 (1) 

! p 


(1) 

(0) 

(1) 


(1) 

(0) 
(0) 




1 
1 
1 


1 


1 


16 








I m 


(1) 


1 


(1) 


(1) 




0 (0) 


(0) 


(0) 




0 


' 


4 


1 








1 (0) 


(0) 




(1) 


(1) 




1 (1) 


(1) 


(1) 




1 




4 


4 








1 (0) 


(0) 




(1) 


(1) 




1 (1) 


(0) 


(0) 




1 


1 


4 


6 








1 (0) 


(0) 


1 


(1) 


(1) 




1 (0) 


(1) 


(0) 




1 


1 


4 


16 








1 (0) 


(0) 


1 


(1) 


(1) 




0 (0) 


(0) 


(0) 




0 


1 


6 


1 








0 (1) 


(0) 


1 


(1) 


(0) 


0 


1 (1) 


1 


(1) 




1 


1 


6 


4 








0 (1) 


(0) 


1 


(1) 


(0) 


0 


1 (1) 


0 


(0) 




0 


1 


6 


6 








0 (1) 


(0) 


1 


(1) 


(0) 


0 


1 (0) 


1 


(0) 




1 


1 
1 


6 
16 


16 
1 






0 


0 (1) 
0 (0) 


(0) 
(0) 


1 

0 


(1) 
(0) 


(0) 
(0) 


0 
0 


0 (0) 

1 (1) 


0 

1 


(0) 

(1) 




0 

1 


1 


16 


4 






0 


0 (0) 


(0) 


0 


(0) 


(0) 


0 


1 (1) 


0 


(0) 




0 


1 


16 


6 






0 


0 (0) 


(0) 


0 


(0) 


(0) 


0 


1 (0) 


1 


(0) 




1 


1 


16 


16 






0 


0 (0) 


(0) 


0 


(0) 


(0) 


0 


0 (0) 


0 


(0) 




0 


2 


1 


1 




0 




(1) (1) 


1 


1 


(1) 


(1) 


1 


^ ^ 


(1) 


(1) 




1 


2 


1 


4 




0 




(1) (1) 


1 


1 


(1) 


(1) 


1 


1 1 


(0) 


(0) 




1 


2 


1 


6 




0 




(1) (1) 


1 


1 


(1) 


(1) 


1 


1 0 


(1) 


(0) 




0 


2 


1 


16 




0 




(1) (1) 


1 


1 


(1) 


(1) 


1 


0 0 


(0) 


(0) 




0 


2 


4 


1 




0 




(1) (0) 


0 


1 


(0) 


(1) 


0 


1 (1) 


(1) 


1 




1 


2 


4 


4 




0 




(1) (0) 


0 


1 


(0) 


(1) 


0 


1 (1) 


(0) 


0 




0 


2 


4 


6 




0 




(1) (0) 


0 


1 


(0) 


(1) 


0 


1 (0) 


(1) 


0 




0 


2 


4 


16 




0 




(1) (0) 


0 


1 


(0) 


(1) 


0 


0 (0) 


(0) 


0 




0 


2 


6 


1 




0 




(0) (1) 


0 


1 


(1) 


(0) 


0 


1 (1) 


(1) 


1 




1 


2 


6 


4 




0 




(0) (1) 


0 


1 


(1) 


(0) 


0 


1 (1) 


(0) 


0 




0 


2 


6 


6 




0 




(0) (1) 


0 


1 


(1) 


(0) 


0 


1 (0) 


(1) 


0 




0 


2 


6 


16 




0 




(0) (1) 


0 


1 


(1) 


(0) 


0 


0 (0) 


(0) 


0 




0 


2 


16 


1 




0 


0 


(0) (0) 


0 


0 


(0) 


(0) 


0 


(1) (1) 


1 


1 




1 


2 


16 


4 




0 


0 


(0) (0) 


0 


0 


(0) 


(0) 


0 


(1) (1) 


0 


0 




0 


2 


16 


6 




0 


0 


(0) (0) 


0 


0 


(0) 


(0) 


0 


(1) (0) 


1 


0 




0 


2 


16 


16 




0 


0 


(0) (0) 


0 


0 


(0) 


(0) 


0 


(0) (0) 


0 


0 




0 


4 


1 


1 


0 


0 


(1) 


(1) 1 


1 


1 


(1) 


(1) 


1 




(1) 


(1) 




1 


4 


1 


4 


0 


0 




(1) 1 


1 


1 


(1) 


(1) 


1 




(0) 


(0) 




1 


4 


1 


6 


0 


0 




(1) 1 


1 


1 


(1) 


(1) 


1 


(1) 0 


(1) 


(0) 


0 


0 


4 


1 


16 


0 


0 




(1) 1 


1 


1 


(1) 


(1) 


1 


(0) 0 


(0) 


(0) 


0 


0 


4 


4 


1 


0 


0 




(1) 0 


0 


0 


(0) 


(0) 


0 


(1) (1) 


(1) 


1 


1 


1 


4 


4 


4 


0 


0 




(1) 0 


0 


0 


(0) 


(0) 


0 


(1) (1) 


(0) 


0 


0 


0 


4 


4 


6 


0 


0 




(1) 0 


0 


0 


(0) 


(0) 


0 


(1) (0) 


(1) 


0 


0 


0 


4 


4 


16 


0 


0 




(1) 0 


0 


0 


(0) 


(0) 


0 


(0) (0) 


(0) 


0 


0 


0 


4 


6 


1 


0 


0 




(0) 1 
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0 


1 
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0 


1 


0 



Suzuki ef al. BMC Medical Research Methodology 201 3, 1 3:1 01 
http://www.biomedcentral.com/1471-2288/13/101 



Page 6 of 17 



Table 4 Enumeration of 48 EDS response types and corresponding potential outcomes (Continued) 
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1 
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(0) 


0 


0 


0 


(0) 


(0) 


0 


(1) (0) 


(1) 


0 


0 


0 


4 


16 


16 


0 


0 


(0) 


(0) 


0 


0 


0 


(0) 


(0) 


0 


(0) (0) 


(0) 


0 


0 


0 



We consider 4 binary variables as follows: exposure f, outcome D, confounder C, and selection variable S (see Figure 1). We show the enumeration under the 
assumptions of positive monotonlcity of f, D, and S and no Interaction at the individual level on the additive scale between C and f on D and E and D on S in the 
counterfactual framework. 

^ Parentheses Indicate that this particular outcome will never be observed. 



Results 

Describing data from observational studies in terms of 
response types 

As demonstrated above, under the situation described in 
Figure 1, individuals can be classified into one of the 
maximum of 1,024 EDS response types. Despite its sophis- 
tication and usefulness, however, the response type of each 
individual is unobservable. Indeed, this is called a funda- 
mental problem of causal inference [40]. Nonetheless, we 
can show the conceptual link between unobservable re- 
sponse types and observed, or observable, data frequencies 
in the population. In this respect, the concept of com- 
pound potential outcomes is quite useful. 



Selected individuals 



Total population 



Observational 
studies 




B. 



Randomized 

controlled 

trials 



D. 



D E 



Figure 2 Four causal diagrams depicting hypothetical 
situations. E, D, C, and S denote exposure, disease, confounder, and 
selection variable, respectively. C may act as a direct effect modifier 
simultaneously. The square around 5 In Figure 2A and C Indicates 
that the analysis Is restricted to those who do not drop out (I.e., 5=1). 
By contrast, 2 diagrams In Figure 2B and D show the situations In 
which Information about total population Is available to researchers. 
See text for details. 



In Figure 3, we describe theoretical data frequencies from 
observational studies in terms of the 1,024 possible EDS re- 
sponse types. We let EiDjSk denote the EDS response type 
[E^ = i,D^ =j,S^ = k] (! = 1, ■••,4, 7=1, --^le, k=l,--; 
16), and let PEiojsk denote a prevalence of the individuals of 
EiDjSk response type in the total population. We also let 
Pc\EiDjsk and Pc\EiDjSk denote probabOities of being exposed 
and unexposed to C among the individuals of EiDjSk 
response type, respectively. When no confusion occurs for 
a dichotomous variable X, we use the notations X and X in 
the terminologies of events of X = 1 and X=0, respectively. 
For example, C and C mean C = 1 and C = 0, respectively. 
Further, N denotes the number of total population. Then, 
data frequencies in each "cell" in Figure 3 can be described 

either as ijkPQEiDjskPEiojsk or N"^,,^PciEiDjSkPEiD/Sk- 

(Note that the former can be also expressed as Aff'c^^ 
ijkPEiDjsk\c and that the latter can be expressed as NP^ 
'y^^i)i,PEiDiSk\c> where Pc and P^ denote probabilities of C 

and C in the total population, respectively). It should be 
noted that individuals can be classified into 16 "cells," 
which is equivalent to a maximum possible number 
of 4 independent random events (i.e., E, D, C, and S). 
The upper and lower parts of Figure 3 show data 
frequencies among the subpopulation with C = 1 and 
C = 0, respectively. Those who are classified into inner 
dashed rectangles represent individuals selected for 
analyses (i.e., S = 1) while those who are not classified 
into the rectangles represent non-selected individuals 
(i.e., S = 0). In other words, the information about the 
individuals outside the rectangles is unavailable to 
researchers. 

Notably, individuals of the same EDS response types can 
be potentially classified into 2 cells. For example, consider 
individual (d who is classified as £1D654 response type 
(see Table 4). This individual is, by definition, exposed to 
E = 1 irrespective of the value of C (i.e., £i(6)) = Eoico) = 1). 
Further, individual (d is expected to experience outcome 
D if there had been interventions to set C to 1 (i.e., Die^ 
(cj) = Dii{o) = 1 ), whereas this individual is expected 
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C= 1 


D=\ 


D = 0 


E=\ 


N 'S^ P P 

^ QEiDjSk^ EiDjSk 

'■=1,2 

7=1,2,3,4,5,6,7,8 
i=9,10,l 1,12,13,14,15,16 


AA V P P 

^ QEiDjSk^ EiDjSk 

/■=1,2 

7=9,10,11,12,13,14,15,16 
4=3,4,7,8,11,12,15,16 




N \ P P 

^ QEiDjSk^ EiDjSk 

(=1,2 

7=1,2,3,4,5,6,7,8 
4=1,2,3,4,5,6,7,8 


V P P 

^ QEiDjSk^ EiDjSk 

i=l,2 

7=9,10,11,12,13,14,15,16 
4=1,2,5,6,9,10,13,14 




E = 0 




^ ^ ^QEiDjSk^EiDjSk 
i=3,4 

7=1,2,5,6,9,10,13,14 
4=1,2,3,4,9,10,11,12 


^ ^QEiDjSk^EiDjSk 

;=3,4 

7=3,4,7,8,11,12,15,16 
4=1,3,5,7,9,11,13,15 




* ^ QEiDjSk^ EiDjSk 
;=3,4 

y=l,2,5,6,9,10,13,14 
4=5,6,7,8,13,14,15,16 


* ^ QEiDjSk^ EiDjSk 
;=3,4 

7=3,4,7,8,11,12,15,16 
4=2,4,6,8,10,12,14,16 



c = o 


D= 1 


D = 0 




A/^ V P- P 

* C\EiDjSk^ EiDjSk 
/=1,3 

7=1,2,3,4,9,10,11,12 
4=9,10,11,12,13,14,15,16 


TV V P- p 

^ ^ C\EiDjSk^ EiDjSk 

M,3 

7=5,6,7,8,13,14,15,16 
4=3,4,7,8,11,12,15,16 




V P- P 

* QEiDjSk^ EiDjSk 
'■=1,3 

7=1,2,3,4,9,10,11,12 
4=1,2,3,4,5,6,7,8 


A^ V P- P 

^ QEiDjSk^ EiDjSk 

(=1,3 

7=5,6,7,8,13,14,15,16 
4=1,2,5,6,9,10,13,14 








^ S ^QEiDjSk^EiDjSk 
(=2,4 

7=1,3,5,7,9,11,13,15 
4=1,2,3,4,9,10,11,12 


^ S ^QEiDjSk^EiDjSk 
/=2,4 

7=2,4,6,8,10,12,14,16 
4=1,3,5,7,9,11,13,15 




AA V P- P 

^ QEiDjSk^ EiDjSk 

/■=2,4 

7=1,3,5,7,9,11,13,15 
4=5,6,7,8,13,14,15,16 


A^ V P- P 

-'^ Zj ^ QEiDjSk^ EiDjSk 
/=2,4 

7=2,4,6,8,10,12,14,16 
4=2,4,6,8,10,12,14,16 



Figure 3 Frequencies of individuals with 1,024 possible EDS response types in observational studies. We consider 4 binary variables as 
follows: exposure E, outcome D, confounder C, and selection variable S (see Figure 1). As shown in Tables 1, 2 and 3, we consider 4 response 
types for E, 15 response types for D, and 16 response types for S. We let EiDjSk denote the EDS response type of [E^ = i, D^ = j, S^ = k], and let 
PBDjsk denote a prevalence of the individuals of EiDjSk response type in the total population. We also let PqeDys* and Pc\BDiSk denote probabilities 
of being exposed and unexposed to C among the individuals of EiDjSk response type, respectively. Further, N denotes the number of total 
population. Those who are classified into inner dashed rectangles represent individuals selected for analyses (i.e., S= 1) while those who are not 
classified into the rectangles represent non-selected individuals (i.e., 5 = 0). See text for details. 
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not to experience outcome D if there had been interven- 
tions to set C to 0 (i.e., Doe„{co) = £)oi(w) = 0). Finally, 
the information about this individual is, by definition, 
available to researchers had there been interventions to set 
C to 1 (i.e., SeiDie^ (w) = Sii(cj) = 1), whereas this individ- 
ual is lost to follow-up had there been interventions to set 
C to 0 (i.e., Se„Doeo{^) ~ '^io(") =0)- Thus, in observa- 
tional studies, individual w of E1D6S4: response type can 
be classified into either one of the following 2 cells in 
Figure 3; one is E=l, D = I, C= 1, and 5=1 while the 
other is £= 1, £) = 0, C=0, and 5 = 0. Note that this de- 
pends on the probabilities that C is present or absent in 
individual o (i.e., Pc\eid6S4 and Pc\EiD6Si)- 

To summarize, Figure 3 shows theoretical data fre- 
quencies in an observational study (i.e.. Figure 2A and 
B). The situation is, however, strikingly different when 
we conduct a randomized controlled trial, which will be 
demonstrated in the next section. 

Describing data from randomized controlled trials in 
terms of response types 

As noted above, researchers manipulate the value of £ in 
randomized controlled trials. Since researchers physically 
prevent E from varying in response to variations in C, we 
do not need to consider E response types when describing 
theoretical data frequencies in ideal randomized controlled 
trials; rather we focus on D response types and 5 response 
types. In other words, observed exposure status and E re- 
sponse types become independent (i.e., £]j£ ) when re- 
searchers marginally intervene on E. Thus, theoretical data 
frequencies from randomized controlled trials can be de- 
scribed in terms of 256 (i.e., 16 x 16) possible DS response 
types, in contrast with 1,024 possible EDS response types. 

We let Pe and Pe denote the probabilities of E and E in 
the total population, respectively. (For simplicity, we de- 
scribe the situation of marginal randomization of E. How- 
ever, the following discussion can be extended to the 
situation of stratified randomization, in which Pe and Pe 
may vary across the strata of C.) Figure 4 shows distribu- 
tions of individuals of the 256 possible DS response types in 
a randomized controlled trial. Note that data frequencies in 
each "cell" in Figure 4 can be described as A/Pg ^ .^Pq 

D/SkPoiSk. NP-E"^,^Pc\DjSkPDiSk' NPE^/cpiSkPoisk' N 

PE^^.i^PciDjskPoiSk ■ (Note that these can be also expressed 
as NPePcYI 

/kPD/Sk\C> NP-eP ^^ jiPDjSk\C> NPeP-c 

^jk'^m\C' ^^^PiPcY^fPm\C' respectively.) As in the 
case of observational studies (Figure 3), individuals can be 
classified into 16 "cells" in Figure 4. 

The theoretical data frequencies in Figure 4 can be 
explained as a re-distribution of individuals in Figure 3. 
For example, consider individual u who is classified as 



either £1£)654 response type or E2D6S4: response type. In 
observational studies, if the value of C of individual w is 1, 
this individual is classified into an upper-left cell within 
the inner dashed rectangle of the upper part of Figure 3, 
i.e., £ = 1, £> = 1, C = 1, and 5=1. Then, when this individual 
is forced to be exposed to £ in a randomized controlled 
trial, this individual would remain in the upper-left cell 
within the inner dashed rectangle of the upper part of 
Figure 4. Note that neither D response types nor 5 re- 
sponse types of this individual change by the intervention 
on £ By contrast, if individual w is forced to be unexposed 
to £, this individual "moves" to a lower-left cell within the 
inner dashed rectangle of the upper part of Figure 4, i.e., 
£ = 0, £> = 1, C = 1, and 5=1. On the other hand, consider 
individual o) who is classified as either £3D654 response 
type or £4D654 response type. In observational studies, if 
the value of C of individual o) is 1, this individual is classi- 
fied into a lower-left cell within the inner dashed rectangle 
of the upper part of Figure 3, i.e., £ = 0, £> = 1, C = 1, and 
5=1. Then, in randomized controlled trials, if this individ- 
ual is forced to be unexposed to £, this individual would 
remain in the lower-left cell within the inner dashed rect- 
angle of the upper part of Figure 4. Meanwhile, if this indi- 
vidual is forced to be exposed to £, this individual "moves" 
to an upper-left cell within the inner dashed rectangle of 
the upper part of Figure 4, i.e., £ = 1, £) = 1, C = 1, and 5 = 
1. These re-distributions can be summarized as 

Pe f Pc\EiD6S4PEiD6Si + P C\EiD(>SiP EiDdSi j 

^ 1=1,2 i=3,4 / 

= NPe PcPEiD6S4\C 
1=1,2,3,4 

= NPePcPd6S4\c 

= NPeP (3) 

and 

Pe ( -^X] ^C\EiD(,SiPEiD6Si + Pc\EiD6S4PEiD6Si j 

V i=l,2 i=3,4 / 

= NPe PcPamsMc 

1=1,2,3,4 

= NP-EPcPoesMc 

= NP-EPciDesiPmsi- (4) 

Note that the numbers in the parentheses of left-hand 
sides of equations 3 and 4 are based on the subpopulation 
of C = 1 in observational studies (i.e., upper part of Figure 3), 
whereas the right-hand sides of these equations are based 
on the subpopulation of C = 1 in randomized controlled 
trials (i.e., upper part of Figure 4). In other words, these 
equations explain how individuals of subpopulation of 
C = 1 are re-distributed as a result of intervention on £ 
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NP V P P 
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NP V P P 

E Z—t C\DjSk^ DjSk 
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4=3,4,7,8,11,12,15,16 




NP ^ P P 

E -* QDjSk^ DjSk 
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4=1,2,3,4,5,6,7,8 


NP V P P 

E C\DjSk^ DjSk 
7=9,10,11,12,13,14,15,16 
4=1,2,5,6,9,10,13,14 




^ = 0 




NP- ^ P P 

^ E QDJSk-^ DjSk 
7=1,2,5,6,9,10,13,14 
4=1,2,3,4,9,10,1 1,12 


NP- V P P 

E C\DjSk-^ DjSk 
7=3,4,7,8,11,12,15,16 
4=1,3,5,7,9,11,13,15 




NP- V P P 

7=1,2,5,6,9,10,13,14 
A:=5,6,7,8,13,14,15,16 


NP- V P P 

E Zu C\DjSk^ DjSk 
7=3,4,7,8,11,12,15,16 
4=2,4,6,8,10,12,14,16 



c = o 


£)= 1 


D = Q 


E= 1 


V P- P 

^^^E Z-l ^QDjSk^DjSk 
7=1,2,3,4,9,10,11,12 
4=9,10,11,12,13,14,15,16 


NP V P- P 

^^^E Z-t ^QDjSk^DjSk 
7=5,6,7,8,13,14,15,16 
4=3,4,7,8,11,12,15,16 




NP V P- P 

^^^E Z-t ^QDjSk^DjSk 
7=1,2,3,4,9,10,11,12 
4=1,2,3,4,5,6,7,8 


NP V P P 

^^^E Z-i ^QDjSk^DjSk 
7=5,6,7,8,13,14,15,16 
4=1,2,5,6,9,10,13,14 








V P- p 

E Z-t QDjSk^ DjSk 
7=1,3,5,7,9,11,13,15 
4=1,2,3,4,9,10,11,12 


A^P- V P- P 

E Z-l ^QDjSk^DjSk 
7=2,4,6,8,10,12,14,16 
4=1,3,5,7,9,11,13,15 




NP- V P- P 

E Z^ ^QDjSk^DjSk 
7=1,3,5,7,9,11,13,15 
4=5,6,7,8,13,14,15,16 


A^P- V P- P 

E Z^ ^C\DjSk^DjSk 
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Figure 4 Frequencies of individuals with 256 possible DS response types in randomized controlled trials. We consider 4 binary variables 
as follows: exposure E, outcome D, confounder C, and selection variable S (see Figure 1). As shown in Tables 2 and 3, we consider 16 response 
types for D and 16 response types for S. We let Pf and Pf denote the probabilities of f and f in the total population, respectively. For the 
definition of other notations, see Figure 3. Those who are classified into inner dashed rectangles represent individuals selected for analyses 
(i.e., 5=1) while those who are not classified into the rectangles represent non-selected individuals (i.e., S = 0). See text for details. 
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Analogous discussion applies when the value of C is 0 
among the individuals of EID6S4:, E2D6S4:, E3D6S4:, or 
E4D6S4: response types. Note that, in observational stud- 
ies, these individuals are classified in either an upper-right 
cell (i.e., E=l, D = 0, C=0, and S = 0) or a lower-right cell 
(i.e., £ = 0, D = 0, C = 0, and S = 0) outside the inner dashed 
rectangle of the lower part of Figure 3. The re- 
distributions of these individuals as a result of intervention 
on E can be summarized as 

Pe\N^^ Pc\EiD6S4PEiD6S4 + -^X] ^ C\EiD6S4P EiD6Si j 
\ '=1,3 i=2,4 / 

= NPe PcPEiD6S4\C 
1=1,2,3,4 

= NPePcPd6S4\C 

= NPePcID6S4Pd6S4, (5) 

and 

Pe I -^X] Pc\EiD6S4PEiD6S4 + P C\EiD6S4P EiD6S4 | 

V '=13 '=2,4 / 

= ^Pe X/ PcPEiD6S4\C 
'=1,2,3,4 

= NPePcPd6S4\c 

= NP-ePcid6S4Pd6S4. (6) 

Again, the numbers in the parentheses of left-hand sides 
of equations 5 and 6 are based on the subpopulation of 
C = 0 in observational studies (i.e., lower part of Figure 3), 
whereas the right-hand sides of these equations are based 
on the subpopulation of C = 0 in randomized controlled 
trials (i.e., lower part of Figure 4). In other words, these 
equations explain how individuals of subpopulation of 
C = 0 are re-distributed as a result of intervention on E. It 
should be noted that these re-distributions do not occur 
across the upper and lower parts of Figures 3 and 4 be- 
cause C precedes E temporally and the value of C is, by 
definition, predetermined before intervention on E. These 
discussions also demonstrate that, in Figure 4, individuals 
of the same DS response types can be potentially classified 
into 4 cells, depending on the probability of being exposed 
or unexposed to C (i.e., Pc\Djsk or Pc\DjSk) ^""i the prob- 
ability of being exposed or unexposed to E (i.e., Pe or P]:). 

Note that, when the information about the total popula- 
tion is available, both marginal and conditional exchange- 
ability assumptions are met in Figure 4; the distributions 
of DS response types are comparable between the exposed 
and unexposed groups. However, when the information 
about those who dropped out is not available, exchange- 
ability assumptions do not hold, either conditionally or 
unconditionally. See (Additional file 1; Appendix 1) for a 



discussion of positivity - another fundamental assumption 
for causal inference [41-43]. 

Epidemiologic measures in terms of response types 

The descriptions of data frequencies in Figures 3 and 4 
have a crucial implication, demonstrating significant 
distinctions between association measures and effect mea- 
sures [9]. In the following sections, we continue to focus our 
discussion on RRs, which can be extended to other mea- 
sures. Note also that, although epidemiologic measures can 
be defined for a variety of target population (e.g., the ex- 
posed and the unexposed), the following discussion focuses 
on the situation in which target population is the total popu- 
lation. Furthermore, we also discuss epidemiologic measures 
in the subpopulation defined by C or S. 

In observational studies (Figure 2A and B), researchers 
can readily calculate associational RRs by referring to the 
notations in Figure 3. In particular, when no information is 
available about those who dropped out (Figure 2A), one can 
calculate an associational RRs=i by using the information 
about individuals classified into the inner rectangles in Figure 3. 
Then, as shown in (Additional file 2: Table SI), associ- 
ational RRs=i can be described in terms of a probability 
of being exposed or unexposed to C among the individ- 
uals of EiDjSk response type (i.e., Pc\EiDjsk or Pc\EiDjSk) 
and a prevalence of the individuals of EiDjSk response 
type in the total population (i.e., PEiDjsk) (equation Al). 
Meanwhile, when researchers are capable of gathering 
information about those who dropped out (Figure 2B), 
the information about individuals of S response types 1 
through 16 is available, which yields an associational RR 
(equation A4). 

By contrast, when researchers obtain data from random- 
ized controlled trials (Figure 2C and D), their frequencies 
can be described in a different way, as shown in Figure 4. 
In these cases, researchers can calculate causal RRs to 
infer causality between E and D. When no information is 
available about those who dropped out (Figure 2C), one 
can calculate a causal RRs=i by using the information 
about individuals classified into the inner rectangles in 
Figure 4. Then, as shown in (Additional file 2: Table S2), 
causal RRs=i can be described in terms of a probability of 
being exposed or unexposed to C among the individuals 
of DjSk response type (i.e., Pc\Djsk or Pc\DjSk) ^ preva- 
lence of the individuals of DjSk response type in the total 
population (i.e., Pojsk) (equation A7). In ideal randomized 
controlled trials without loss to follow-up (Figure 2D), the 
information about individuals of S response types 1 
through 16 is available, which yields a causal RR 
(equation AlO). We should note that the causal RR shown 
in equation AlO is an alternative notation of the causal 
RR shown in equation 1 in terms of response types 
(see Additional file 1: Appendix 2). 
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Note that, even in ideal (either marginal or stratified) 
randomized controlled trials, one may observe a hetero- 
geneity between stratum-specific causal RRs, which will 
be addressed in the section entitled "Modification of epi- 
demiologic measures". 

Confounding bias 

In this section, we aim to further clarify the concept of 
confounding bias in the counterfactual framework, by 
describing it in terms of response types. 

We show a sufficient condition to estimate effect mea- 
sures in observational studies by adjusting for confounding 
bias in terms of response types. In this case, we use effect 
measures in the total population in ideal randomized con- 
trolled trials (i.e., causal RR) as a gold standard. As noted 
above, confounding bias is induced by a common cause C 
of E and D. Thus, to show a sufficient condition to adjust 
for confounding bias, we need to compare association mea- 
sures in the total population in observational studies 
(Figure 2B) and effect measures in the total population in 
randomized controlled trials (Figure 2D) In other words, a 
sufficient condition to adjust for confounding bias can be 
described as: adjusted associational RR = causal RR. Note 
that we here compare 2 distinct types of epidemiologic 
measures, which are obtained from distinct study designs. 

To adjust for confounding bias in observational studies, 
one may calculate a weighted average of stratum-specific 
associational RRs, or standardization, expecting to esti- 
mate a causal RR. By using stratum-specific associational 
RRs (equations A5 and A6), this procedure can be de- 
scribed in terms of response types as follows: 



Notably, this is not equivalent to a causal RR (equation 
AlO). In other words, this stratification-based procedure 
does not "delete" the arrow from C to £ in Figure 2B, 
yielding subtly different measures from effect measures. 
When one can assume conditional exchangeability (i.e., 
£]jDe|C for Ve), the weighted average of stratum-specific 
associational RR can be expressed as 

/=1,2,3,4,5,6,7,8 /=1,2,3,4,9, 10,11, 12 

E Pc\DiPD,+ E ^C|dA/' 

/=1,2,5,6,9, 10,13, 14 ;=1, 3,5,9,11, 13,15 

which is equivalent to a causal RR in equation AlO (see 
Additional file 1: Appendix 3). Indeed, the condition 
£]jDe|C for Ve is a sufficient condition to estimate effect 
measures in observational studies by adjusting for 
confounding bias, and the assumption of exchangeability 
often gets most of the attention in discussions about 
causal inference [44]. Unfortunately, however, the condi- 
tion is not guaranteed in observational studies, and expert 
knowledge is required. Importantly, the assumption of 
conditional exchangeability, i.e., £]jDe|C for Ve, is subdy 
weaker than another sufficient condition to estimate effect 
measures in observational studies, i.e., full conditional 
exchangeability, or EY[{Dg^i, De^o)\C [45]. It may be diffi- 
cult, however, to imagine a practical scenario where the 
former holds but not the latter [46], and the word "ex- 
changeability" has been sometimes used interchangeably 
in the literature. (A combination of full exchangeability 
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and positivity has been termed "strongly ignorable treat- 
ment assignment" assumption or "strong ignorability," 
whereas a combination of exchangeability and positivity 
has been termed "weakly ignorable treatment assignment" 
assumption or "weak ignorability" [2,47,48].) By compar- 
ing equations 7 and 8, we can show that the conditions 
]J D^\C and E ]J D^\C are also sufficient conditions to 
estimate effect measures in observational studies by 
adjusting for confounding bias (see Additional file 1: 
Appendix 3). In Additional file 1: Appendix 4, we show a 
proof of the following inclusion relation: 

E'^ l[D^ I C^El[D^ I C=>£U iDe=i ,De=o)\ C=>El[De I C for Ve 

The subde differences between £^ ]J C and £ ]J C 
are described graphically in the section entitled "Extended 
causal diagrams integrating response types". It is worth- 
while to mention that the condition E WD \C is not 
guaranteed in randomized controlled trials. 

The above discussion implies that analytic adjustment 
for C in observational studies has consequences quite 
different from those of physical control in randomized 
controlled trials. Even when adequate analytic control of 
C may be envisaged in observational studies, researchers 
cannot estimate effect measures without the assumption 
external to data. See Additional file 1: Appendix 5 for a 
discussion of recently-introduced assumptions to com- 
pensate for a lack of randomization. 

Selection bias 

In this section, we aim to further clarify the concept of 
selection bias in the counterfactual framework, by de- 
scribing it in terms of response types. 

We show sufficient conditions for non-selection bias in 
terms of response types. As explained above, selection bias 
is induced by conditioning on a common effect of E and D 
(Figure 2 A and C). Thus, to show sufficient conditions for 
non-selection bias, we need to specify epidemiologic mea- 
sures, i.e., association measures or effect measures. With re- 
gard to association measures, a sufficient condition for 
non-selection bias is described as associational RRs=i = 
associational RR (see equations Al and A4). Likewise, a suf- 
ficient condition for non-selection bias for effect measures 
is described as causal RRs=i = causal RR (see equations A7 
and AlO). It is worthwhile to mention that, when discussing 
selection bias, one need to specify a stratum of S [21]. In 
most cases, researchers are interested in the presence and 
the degree of selection bias among the subjects who do not 
drop out. Thus, we here show sufficient conditions for 
non-selection bias in a stratum S = 1. As explained later by 
using extended causal diagrams, selection bias results in 
violation of £ ]J even when exposure is randomly 
assigned. 



Modification of epidemiologic measures 

For decades, epidemiologists have used the term "effect 
modification" in a broad context, simply referring to 
a variation in the selected effect measure for the 
factor under study across levels of another factor [49]. 
In this respect, a recent paper clarified the distinction 
between interaction and effect modification within the 
counterfactual firamework [31]. It is also well known that 
the presence, direction, and size of modification can be 
dependent on the choice of measure [50]. Since the term 
"effect modification" is ambiguous, it is now recommended 
to specify the measures more precisely, e.g., risk-difference 
modification [50]. The above discussion implies that re- 
searchers need to distinguish association-measure modi- 
fication and effect-measure modification. For example, 
when the information about total population is available in 
a randomized controlled trial, causal-RR modification is 
defined to be present if stratum-specific causal RRs from 
each subpopulation varies across the strata of C, i.e., causal 
RRc=i ^ causal RRc=o (see equations All and A12). 
When stratum-specific causal RRs are (approximately) 
homogeneous or uniform across strata, researchers usually 
pool the data to calculate a causal RR in the total popula- 
tion (i.e., causal RR). In a similar manner, one can define 
associational-RR modification (see equations A5 and A6). 
Only if it is appropriate to pool the data across the strata 
of C, one can validly interpret associational RRs in the 
total population. 

Notably, the presence of association-measure modifi- 
cation does not necessarily imply the presence of effect- 
measure modification, and vice versa. 

Extended causal diagrams integrating response types 

In this section, we attempt to explain the concept of bias 
by extending causal diagrams, which integrate response 
types and observed variables. Although these causal dia- 
grams, or extended DAGs, may appear less intuitive, 
they maintain the integrity of the original DAGs and 
would be of great use in graphically describing the find- 
ings discussed in this study. In particular, by integrating 
response types and observed variables, we can readily 
understand subtle differences between E WD \C and 
£]j£)^|C, demonstrating sufficient conditions to esti- 
mate effect measures in observational studies. 

Figure 5 shows the hypothetical situation described in 
Figure 1 by integrating response types of E, D, and S 
(i.e., E^, D^, and S^, respectively). First, note that the 
only arrows emanating from the response types point to 
the corresponding observed variables, i.e., E -^E, D —>■ 
D, and S. Then, to describe the underlying correl- 
ation between E^, D^, and S^, we use a total of 3 un- 
measured common causes, Ul, U2, and i/3, which are 
independent of each other. In other words, these 
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Figure 5 An extended causal diagram depicting a 
hypothetical example. 



unmeasured common causes as a whole represent un- 
derlying personal characteristics determining his/her re- 
sponse types of E, D, and S. (Note that, unlike Ul, U2, 
and U3, we assume that C does not determine his/her 
response types. In other words, we assume that Ul, U2, 
and U3 precede response types, while C does not.) It 
would be worth to mention that Figure 5 well describes 
how the observed variables are determined in response 
to the corresponding response types and their measured 
parent(s). For example, each individual has 2 potential 
outcomes of E: the outcome that would occur if C is 
present in that individual, £1, and the outcome that 
would occur if C is absent in that individual, Eq. Thus, 
we observe E = CEi + (1 - QEq. In Figure 5, this equation 
is illustrated by the only 2 arrows from E^ and C to E. 

In randomized controlled trials, in which E is margin- 
ally randomized, researchers physically prevent E from 
varying in response to variations in C by intervening on 
E. Thus, by applying the rule used in the standard DAG 
theory, the 2 arrows pointing to E in Figure 5 are re- 
moved. As a result, observed exposure status and E re- 
sponse types become independent (i.e., £]j£^), as 
shown in Figure 6. In addition, observed exposure status 



becomes independent of D response types and S re- 
sponse types (i.e., £]j£)^ and £]JS^, respectively) be- 
cause the value of £ is, by definition, determined 
randomly. Note that £]j£)^ implies an assumption of 
(full) exchangeability. Trivially, observed exposure status 
is also independent of D given C, i.e., EjjD^lC thus 
implying the assumption of (full) conditional exchange- 
ability. Importantly, even when adjusting for C, marginal 
randomization of £ does not result in independence be- 
tween £ response types and D response types due to 2 
open paths, £^ <— L/1 — > and £^ <— U2 — > D^. If we 
adjust for Ul and U2, they become independent. Finally, 
Figure 6 also clearly shows that selection bias results in 
violation of £]JD^; when some of the subjects are lost 
to follow up, 7 marginally blocked paths between £ and 
(i.e., E^S^D^D^, E ^ S ^ ^ Ul ^ D^, E^ 
S^S^^U2^D^, E^S^S^ ^m^D^, E^D^ 
S^S^ ^Ul^D^, E^D^S^S^ ^U2^D\ and 
E^D^S^S^ ^U3^D^) become open because we 
condition on the collider S. Indeed, extended DAGs are 
of great use to demonstrate that both confounding bias 
and selection bias result in lack of (full) exchangeability 
of the exposed and unexposed groups. 

Meanwhile, when using stratified randomization of £ 
by C, researchers physically prevent £ from varying in 
response to variations in £ response types, but the prob- 
ability of £ may vary across the strata of C. Thus, unlike 
the marginal randomization of £, of the 2 arrows 
pointing to £ in Figure 5, only the arrow from £ to £ is 
removed (Figure 7). As a result, there is an open path 
between £ and £ , i.e., £ <— C <— Z/1 — > £ , which can be 
blocked by adjusting for C (i.e., £]j£^|C). Further, al- 
though there is an open path between £ and D , i.e., 
E^C^Ul^D^, this path can be blocked by 
adjusting for C (i.e., £]j£'^|C), which implies that the 
assumption of (full) conditional exchangeability can be 
readily met in stratified randomization of £. Finally, 
there is an open path between £ and S , i.e., E<— C<^ 
Ul S^, which can be also blocked by adjusting for C 
(i.e., £]JS^|C). Note that, like Figure 6, even when 
adjusting for C, stratified randomization of £ does not 
result in independence between £ response types and D 
response types, and we need to additionally adjust for 
Ul and U2. 

Figure 8 shows the situation in observational studies, in 
which researchers stratify by C to calculate a weighted aver- 
age of stratum-specific association measures. (A square 
around C means that we condition on C.) Unlike marginal 
or stratified randomization of £, observed exposure status 
is determined in response to £ response types as well as the 
status of C. Therefore, no arrows pointing to £ are removed 
in Figure 8. Note that, in Figure 8, £ and D would be mar- 
ginally connected via the following 3 paths, i.e., E<—E <— 
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Figure 6 An extended causal diagram depicting a hypothetical 
situation under marginal randomization. 



m^D^, E^E^ ^U2^D^, and E^C^Ul^D^. 
When we condition on C, only the third path can be 
blocked, and E and D remain connected via the first 2 
paths. Notably, the 3 paths can be theoretically blocked by 
conditioning on Ul and U2. In other words, a sufficient 
condition of £]JD^|C is to adjust for Ul and U2 in obser- 
vational studies. Meanwhile, in equation 8, we demonstrate 
that the condition £^]JD^|C is a sufficient condition to es- 
timate effect measures in observational studies without loss 
to follow-up. Indeed, E and D are not independent given 
C in Figure 8, and they are connected via the following 2 
paths, conditional on C, i.e., E <— Ul —> D and E <— 
U2 D^. Note that both paths can be theoretically blocked 
by conditioning on Ul and U2. To summarize, although 
£^]JD^|C and £]JD^|C are sufficient conditions to esti- 
mate effect measures in observational studies, neither is 
guaranteed in observational studies, and expert knowledge 
is required. In particular, although £^]JL)^|C is stronger 
than £]JD^|C, we need to adjust for Ul and U2 to achieve 
either condition as shown in Figure 8. 

Finally, it is worthwhile to mention that the perspectives 
of the extended DAGs are different from those of the twin 
network method, which has been developed to deal with 




S 

1 

I ► 5T ^ 1 

I 

I U2 1 

Figure 7 An extended causal diagram depicting a hypothetical 
situation under stratified randomization. 

counterfactual values in DAGs [2]. This graphical method 
uses two networks, one to represent the actual world and 
one to represent the hypothetical world. Thus, this method 
is used to represent the causal relations under intervention. 
The aim of our extended DAGs is to integrate response 
types and observed variables, which is thus applicable 
to observational studies as well as randomized con- 
trolled trials. As a consequence, we can use the ex- 
tended DAGs to describe the sufficient conditions to 
infer causality in observational studies in terms of re- 
sponse types. 

Discussion 

We have clarified the concepts of bias and effect modifica- 
tion in the counterfactual framework, by describing theoret- 
ical data firequencies from observational studies and 
randomized controlled trials in terms of response types. Al- 
though these concepts have been extensively explained in 
the epidemiologic literature, most of the studies have 
discussed them separately. In this article, we have highlighted 
the relations between these concepts, by discussing them 
simultaneously. The present findings would somehow clarify 
the link between the assumptions for making causal 
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Figure 8 An extended causal diagram depicting a hypothetical 
situation in observational studies. 



inferences in observational studies and the counterfactual 
approach, demonstrating the inherent distinctions between 
observational studies and randomized controlled trials. The 
extension of DAGs using response types maintains the integ- 
rity of the original DAGs, which allows one to understand 
the underlying causal structure discussed in this study. 

We have shown a hitherto unrecognized sufficient condi- 
tion £^]j£)^|C to estimate effect measures in observational 
studies by adjusting for confounding bias. This condition is 
stronger than the assumption of (fuU) conditional exchange- 
ability, and it is not straightforward to discuss technical ad- 
vantages of the hitherto unrecognized condition. Such 
consideration however would enable one to further under- 
stand the conceptual link between unobservable response 
types and observed, or observable, data frequencies in the 
population. This would also facilitate understanding of the 
underlying causal structures of bias and effect modification. 

In this article, we use a simple hypothetical situation, 
including only 4 binary variables. Thus, it should be 
noted that the present study does not encompass more 
complicated situations, e.g., M-bias [51]. It is however 
worthwhile to mention that the condition £^]J/)^|C is 
applicable even when an exposure and an outcome are 



polytomous variables, because our discussion based on 
the extended DAGs does not restrict the type of vari- 
ables. When considering situations in which there are 
some confounders, the present finding would apply by de- 
fining and estimating a function of measured confounders 
that can be treated as a single confounder. It should be 
also noted that we focused only on direct effect modifica- 
tion, and thus, the present discussion does not necessarily 
apply to other types of effect modification, i.e., indirect ef- 
fect modification, effect modification by proxy, and effect 
modification by a common cause [22]. Further, this study 
does not address the issue of information bias or measure- 
ment error. Recent studies have discussed how DAGs can 
be used to represent them [52-55], which should be 
addressed further in future studies. 

Conclusion 

As shown in the present study, researchers should 
recognize inherent limitations of observational studies 
in estimating causal effects. It should be emphasized, 
however, that the recognition should come in the inter- 
pretation of the evidence when trying to draw conclu- 
sions, not in the statement of research goals or study 
design and conduct phases [56]. The data from obser- 
vational studies yield measures of association and those 
who examine the data should strive to impose a mean- 
ing based on their expert knowledge on each occasion, 
which would improve causal interpretations. 
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